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Abstract — Consider the following routing problem in the con- 
text of a large scale network G, with particular interest paid 
to power law networks, although our results do not assume a 
particular degree distribution. A small number of nodes want to 
exchange messages and are looking for short paths on G. These 
nodes do not have access to the topology of G but are allowed 
to crawl the network within a limited budget. Only crawlers 
whose sample paths cross are allowed to exchange topological 
information. In this work we study the use of random walks 
(RWs) to crawl G. We show that the ability of RWs to find short 
paths bears no relation to the paths that they take. Instead, it 
relies on two properties of RWs on power law networks: 

1) RWs ability observe a sizable fraction of the network 
edges; and 

2) an almost certainty that two distinct RW sample paths 
cross after a small percentage of the nodes have been 
visited. 

We show promising simulation results on several real world 
networks. 

I. Introduction 

Consider the following routing problem in the context of 
a large scale network G with n ^> 1 nodes, with particular 
interest paid to power law networks, although our results do 
not assume a particular degree distribution. A small set of 
nodes (h in total) want to exchange messages and are looking 
for short paths on G. We consider routing in two phases: 
Topology discovery and shortest paths calculation. In the 
topology discovery phase each node must crawl the network in 
order to partially discover the network topology. Nodes have 
limited crawling budget and only crawlers whose sample paths 
intersect are allowed to exchange topological information. 
Upon finishing the topology discovery phase nodes are allowed 
to optimally route on the sampled topology. 

In this work consider crawling G with h independent 
random walkers (RWs). We show that the ability of RWs to 
find short paths bears no relation to the path that they take. 
Instead, it relies on two RW properties on power law networks: 

• RW's ability observe a sizable fraction of the network 
edges; and 

• two RW sample paths (starting at different nodes) cross 
after a small percentage of the nodes have been visited, 
with high probability. 




Fig. 1 . Illustration of a RW sample path. Green nodes are the starting nodes, 
the blue nodes are nodes visited by the random walkers, and the purple edges 
are the edges used by the walkers to explore the graph which is a subset of 

E(B, i). 

We provide promissing simulations results on several real 
world networks. 

A. Framework 

In what follows we formalize the model. Consider an 
undirected graph G = (V, E) with n nodes and m edges. 
Start h random walkers from nodes U — {u\, . . . ,Uh} to 
explore the network. We assume each walker takes B steps. 
Each walker leaves bread crumbs so that one can trace its 
path back towards the initial node. Let X(t, i) — (xi, . . . , Xt), 
i = l,...,h, denote the sequence of nodes seen by the i- 
th walker at step t. Let S(t, i), i = 1, . . . , h, denote the set 
of nodes visited by the i-th walker at time t, i.e., S(t, i) 
is X(t,i) without repeated nodes. And let E(B,i) denote 
the set of edges belonging to the nodes visited by the i- 
th walker, S(B, i), after the walker budget B is exhausted. 
Upon exausting its budget RW i outputs X(B,i), S(B,i), 
and E(B, i). 

B. Naive Routing 

Note that if S(B, i) n S{B,j) ^ 0, i,j = 1, . . . , h, then 
walker i must have found walker j's breadcrumbs or vice- 
versa. Without loss of generality assume walker i found j's 
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Network 


nodes 


edges 


(k) 


(k 2 )-{k) 
(k) 


Flickr [12] 


1.7M 


31. 1M 


18.1 


943.4 


AS Net. [3] 


26.5K 


107K 


4 


279.24 


Livejournal [12] 


5.2M 


9.8M 


18.9 


154 


Enron [1] 


37K 


368K 


10 


140 


Gnutella [15] 


62.5K 


296K 


4.7 


10.6 


Power Grid [20] 


5K 


13K 


2.7 


2.9 



TABLE I 
Summary of networks 

breadcrumbs. A naive way to construct a path between Ui 
and uj is to route messages retracing all RW steps. Node m 
acomplishes this by retracing X(B, i) to contact one of the 
nodes in v e S(B,i) n S(B,j). Once the message reaches 
node v, it reaches its final destination, node Uj, following 
j's breadcrumbs starting from v, i.e., the message follows the 
reverse RW path from Uj to v. The breadcrumb is installed 
only at the first visit to a node, which guarantees that the 
reverse RW path is without loops. 

C. Drawbacks of Naive Routing 

The naive algorithm presented above is quite inefficient. 
Random walks are not particularly good at finding short paths 
(as seen in Section V); unless there are few paths between 
pairs of nodes, e.g., G is a tree. A random walk with erased 
loops, also known as a loop-erased random walk, samples all 
possible spanning trees of G with equal probability [17]. This 
can lead to long paths lengths between m and Uj. 

D. The True Power of RWs in Finding Short Paths 

The ability of RWs to find short paths bears no relation to 
the paths that they take. Instead, it relies on RW properties 
found under conditions often present in power law networks. 
We summarize the main results of Section II here: 

. \E(B,i)\ w < fc2 )-( fc ) B, where B«n,t = l,...,/i, and 
(k) and (fc 2 ) are the first and second moment of the node 
degrees in original graph G, respectively. 
• Under mild conditions two RW sample paths cross, with 
high probability. 
These observations motivate our use of RWs to partially 
uncover the topology of G. 

E. Outline 

The outline of this work is as follows. Section II charac- 
terizes the subgraph obtained by the i-th random walker and 
the probability that two RW sample paths cross. Section III 
presents the Random Walk Short Path (RWSP) algorithm. 
Section IV presents our results when simulating RWSP on 
real world networks. Finally, Section V presents a discussion 
and related work. 

II. RW Subgraph Properties 

In this section we show that the real power of a random 
walk resides in its ability to collect edges from G. Let E(B, i) 
denote the set of edges belonging to the nodes visited by the 
i-th walker, S(B, i), after the walker budget B is exhausted. 



We see that RW i, i = 1, . . . , h, builds an edge rich subgraph 
G'(B,i) = (V, E(B, •)) with budget B = /3n in a network 
with large degree variance. In what follows we assume n^> 1, 
B <C n, and that RW i has access to the neighbor list of 
each visited node, such that E(B,i) is obtained without extra 
sampling costs. 

These are the main results of this section: 
. \S(B, -)| w B, i.e., the set of nodes visited by the RW 
scales linearly with B. The RW tends to collect only new 
nodes at the beginning of the walk. 
. \E(B, i)\ w {k2) ( ~ {k) B, where (fc) and (fc 2 ) are the first 
and second moment of the node degrees in original graph 
G, respectively. 
• We provide conditions under which there is a RWSP 
path between any pair of nodes (ui,Uj), Vi, j, with high 
probability. 

In power law networks q = ((fc 2 ) — (fc))/(fc) is likely to be 
quite large, and thus G' spans a sizable fraction of G edges 
even when B is a small fraction of the total number of nodes. 
For instance, in the Flickr social photo sharing website q = 
943.4, in the Enron email network this value is q = 139, and 
in the Livejournal blog posting social network this value is 
q = 154. Table I presents values of q for several real world 
networks. 

A. The RW subgraph G'(B,i) contains a sizable fraction of 
the edges of G. 

In what follows we find a closed approximation formula 
that gives the number of edges in G'(t, i), t < B. Let S(t, i) 
denote the set of nodes visited by the i-th walker at time step t, 
i = 1, . . . , h. As we are interested in large graphs, we assume 
n » 1. Let (fc) and (fc 2 ) be the first and second moment 
of the node degrees in the original graph G, respectively. Let 
t = t f 71 and n e (r) = E [\E(rn, be the average number of 
edges in S(rn, i), where E[-] is the expectation operator. We 
provide a closed form mean field approximation (Theorem 2. 1 
in Section II-C) for n e (t) when t < (fc) 2 /(fc 2 ) 



n e (r) = 2m I 1 — exp 



(fc 2 )-(fc). 



(1) 



and 



E[\S(rn,i)\} = T^ 



m ( ( 



(k 2 ) ~ (k) 
(fc) 2 



Figures 2 and 3 compare our mean field approximation 
in (1) with simulation results for the Gnutella and Flickr 
networks. The Gnutella network cannot be considered a power 
law network as its maximum degree is quite small (one 
hundred), but our approximation works for any degree dis- 
tribution not only for power laws. Please refer to Table I for 
characteristics of these two networks. 

To see how n e (r) grows with B = f3n let's decompose (1) 
into its Taylor series expansion 
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Fig. 2. n e (r) in the Gnutella network. This graph shows the fraction of 
edges in S(t, •) against t = t/n m the Gnutella network. The simulation 
results of 10 runs are shown in a scatter plot of red crosses. The mean field 
approximation in equation (1) is shown as a green line. In the Gnutella network 
((k 2 ) — (k))/(k) = 10.6, which is not as large as in a "true power law" 
network. 
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Fig. 3. n e (r) in the Flickr network. Same as Figure 2 over the Flickr 
network. In the Flickr network ((A: 2 ) - (k))/{k) = 943.4. 



Eq. (2) yields for (3 < 1 

n e (P) f 



(fc 2 ) ~ (fc) 
(k) 



f3n. 



(3) 



In a power law network in the asymptotic regime n — > oo, 
i.e., if k is the node degree then it has degree distribution 



p k oc (a - l)k a . 



(4) 



For power law degree distributions the approximation in (3) 
yields 

oo if a < 3 

(3n{a-i)' 1 ifa>3. 



n e {P) 



While n e (f3) 7^ oo in any real world network, the above 
equation provides an invaluable hint that n e (f3) can be quite 
large in real life. Indeed, in the Flickr network we observe 

n flickr^ _ 943 4 x lj700; 000 x & edges, 

which means that by the end of the crawling the walker 
collects on average 943.4/3n distinct edges. Table I shows the 



value of q = ((k 2 ) — (k))/(k) for several real world networks. 
Before we proceed to the proof of equation (1), we switch 
gears and show that under certain conditions a RW starting at 
itj, i = 1, . . . , h, visits a node that is also visited by RW j, 
Vj 7^ i, with high probability. 

B. With high probability two RW sample paths cross. 

In what follows we show that under certain conditions the 
RW sample paths starting at nodes itj and Uj, Vi,j, cross 
with high probability. Assume, without loss of generality, that 
RW i completes before we start RW j. The intuition behind 
our result is that for sufficiently large \E(B,i)\/2m, S(B,i) 
functions as an attractor for the j-th RW, such that with high 
probability RW j hits the set S(B, i) before B steps. The 
above reasoning, however, cannot be applied to a RW that has 
a long transient; for instance a RW on a graph shaped like 
dumbbell likely will not satisfy our conditions. 

We are interested in the hitting time of RW j into the set 



S(B, i). Let X = {X[ j) 
by RW j. Then 

P[Xr\S(B,i) = 



} be the set of nodes visited 



denotes the probability that RW j and RW i do not meet. 
Unfortunately finding the hitting time to a set of nodes in a 
general graph is a hard problem. Solutions based on spectral 
methods (e.g., Chau and Basu [5, Theorem 1]) do not yield 
simple expressions for the general case. A more detailed 
discussion is found on Section V. Hence, we consider the 
following approximation. Let B = f3n, n 3> 1, and 

be the fraction of edges that belong to nodes in S(B,i), i.e., 
nodes visited by the i-th RW. Note that j(B,i) is the steady 
state probability that RW j visits a node in S(B, i) at any 
given time step. As n > 1 the average value of j(B, i) is 

MP) 



^)=E[ 1 {B,i)] 



2m 



Let A„ = o(n) and assume Vw e {1,B— A} that for some 
(possibly small) constant c > 0, the following inequality holds 



P[X U) 



w+A 7 



G S(J3n,i)\X& $ S(pn,i)} > &f(p). (5) 

That is, if RW j is not at S(B, i) at time w then at time w + A 
it will be at S(B, i) with probability that is at least a constant 
factor of 7(/3). In a graph with o(n) mixing time c w 1 and 
the inequality in (5) becomes an equality. However, sublinear 
mixing time is a sufficient but not necessary condition in (5), 
which prompts us to conjecture that (5) holds for a broader 
class of graphs. Note that 

P[x n s(B, i) = 0] < P[{x u X A+1 , . . . } n S(B, i) = 0] 

as the probability that RW j at steps 1, A + 1, . . . does not hit 
S(B,i) is higher than the probability that RW j at all steps 
1,2, ... ,B does not hit S(B, i). Application of (5) and the 
strong Markov property yields 

P[X n S(B, i) = 0] < (1 - cy(P)) [Pn/Ai . 



(6) 
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From (1), cry(P) > 0. As n > 1, implies [/3n/A\ > 1 we 
can put it all together into (6), yielding, for e<l, 

P[Xr\S(B,i) = 9\ < e. 

Thus we conclude that if the condition states in (5) is satisfied 
then walkers i and j must meet with high probability. In what 
follows we provide a proof for (1). 

C. Mean field analysis of the number of edges in G'(t,i). 

Theorem 2.1: Let r = t/n < (k) 2 /(k 2 ) and let n e (r) = 
£7 [|£(rn, i)|] be the average number of edges in S(rn, •). 
Then 



n e (r) = 2m ( 1 — exp 



(fc 2 ) - (fc) 
(fc) 2 



Proof: By following an edge, a RW reaches a node v 
with degree fc„ with probability proportional to k v . Node v has 
average degree higher than the average degree in the graph. 
This is a classical case of the inspection paradox and therefore 
E[k v ] = (k 2 )/(k). If v £ S(t, i) we are adding at most k v - 1 
edges to E(t, i), (the minus one is a consequence of the need 
to remove the edge the RW followed to reach v which was 
already in E(t, i)). If G was a random configuration graph, 
as t <C n and n» 1, the probability that any of the k v — 1 
edges are already in E(t, i) is negligible. Thus, by adding v 
we are adding q = E[k v ] — 1 edges to E(t,i). 

Assuming the RW is sampling edges uniformly at random, 
we can write the following expression for n e 



n e (r + l/n) - n e (r) = qP[v # S(t, i)]=q[l- 



2m 



where n e (r)/2m is the probability that a randomly sampled 
edge is in E(t, i). Multiplying both sides of the above equation 
by n and letting n — > oo yields 



n e (r + l/n) - n e (r) / n e (r 

= nq 1 



l/n H V 2 ™ 

dnJr) q ,„ , 
T=(f) (2m_ne(T)) 

with the boundary condition n e (Q) = 0. The solution to the 
above differential equation is 

n e (r) =2m(l-e- qT l^), 

concluding the proof. ■ 

Lemma 2.2: Let r = t/n < (k) 2 /(k 2 ). Then 



E[|5(Tn,i) 



(fc 2 ) - (fc) 



1 — exp 



(fc 2 ) ~ (fc) 
(fc) 2 



Proof: The proof of follows from Lemma 2.1 with 
S[|S(rn,i)|]=n e (r)/«. ■ 



III. The Random Walk Short Path (RWSP) 
Algorithm 

The RWSP algorithm is as follows: Initialize W{i) = and 
C(i) = 0, i = l,...,h. 

1) Start walker i at node Ui £ U with budget B, i — 
l,...,h. 

2) For each walker i = 1, . . . , h: 

a) At time step t update the "explored" graph G'(t, i) 
as new nodes are visited and new edges are dis- 
covered. 

b) If the t-th visited node, x t , has already been visited 
by a set of walkers W £ {1, . . . , h} w ; 

i) Assign W(i) <r- W(i) U W; 

ii) Assign C(i) <- C(i) U {x t }\ 

Ui sends a message to uj advertising that node u% 
just found x t (sending this last message requires 
tracing back the path x t to Uj using fs bread- 
crumbs), Vj G W. 

c) Stop when t = B. 

3) At the end of walk send G'(B, i) to Uj, Vj e VK(«), via 
the nodes in C(i) that have been visited by walker j. 

4) Upon receiving (j, v), v e V, from walker j ^ i: assign 
W(i) <- W{i) U {j} and C(i) <- C(i) U {v}. 

5) Let G*(i) = D reW{i)VJ{ i}G'(B,j) denote the union 
of all subgraphs known to ttj. Compute the minimum 
spanning tree T(i) on G*(i) starting from Uj. 

Node uses T(z) to route its messages to nodes Uj on G, 
Vj € W(i). Figure 1 shows a snapshot of the algorithm. In 
what follows we present our simulation results. 

IV. Results 

This section presents our results when simulating RWSP on 
real world networks. Table I summarizes the datasets used in 
our simulations. Figure 4a show the fraction of RW shortest 
path lengths on G* defined in Section III (X axis) between h = 
4 nodes vs. the true shortest path lengths (Y axis) [blue/gray 
matrix]. We choose h = 4 walkers because as we increase fc 
our results get better if B is kept constant. We also evaluate 
different values of h, h € {2,6,8, 16}, rescaling B such that 
for a constant f3 > 0, B = fin/h. The results for different 
values of h are similar to those presented here, thus due to 
space constraints we omit them. 

In Figure 4a the yellow bars next to the Y axis show the 
true all pairs shortest path length distribution. For instance, we 
in LiveJournal's most of the shortest paths are between 5 and 
6 hops. The RW sampling budget is B = 2.5% of the nodes 
in the graph. A total of 10, 000 runs were used to produce the 
averages seen in the graph. The axis marking INF refers to 
the fraction of times that a node could not reach another node 
(due to disconnected components). In all runs the RWSP was 
able to find nodes in the same component (we condition the 
RWSP to start in the giant component). We also tested RWSP 
over other networks with similar parameters. 

Our results are very promising. For Livejournal (Figure 4a) 
more than 60% of the RWSP paths of any given length are the 




(a) LiveJournal (h = 4, B = 0.025 n) 




(c) Gnutella (h = 4, B = 0.05 n) 




(e) Enron (h = 16, B = 0.0125 n) 
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(b) Flickr (/i = 4, B = 0.025 n) 




(f) Power Grid (h = 4, B = 0.05 n) 



Fig. 4. RWSP vs. Shortest Paths, h walkers and sampling budget B. Fraction of RW shortest path lengths (X axis) between nodes vs. true shortest path 
lengths (Y axis) [blue/gray matrix]. Yellow bars next to Y axis show true shortest path lengths distribution. The axis marking INF refers the fraction of times 
that a node could not reach another node (due to disconnected components). 



6 



shortest paths. Moreover, more than 90% of the RWSP paths 
of any given length are within one hop of the shortest paths. 

We also test RWSP on Flickr, AS graph, a snapshot of the 
Gnutella network, Enron email dataset with similar results. 
On Flickr (Figure 4b) (B = 0.0125 n) more than 85% of the 
RWSP paths of any given length are the shortest paths; also, 
all paths are within 2 hops of the shortest paths. On Gnutella 
(Figure 4c) (B = 0.0125 n) network we observe more than 
60% of RWSP paths of all lengths to be within 2 hops of 
the shortest paths. For the AS graph (Figure 4d) (RWSP with 
B = 0.05 n), more than 65% of RWSP paths of all lengths 
are the shortest paths; the majority (> 95%) of RWSP paths 
now counting over all lengths are the shortest paths. On Enron 
(Figure 4e) (B — 0.05 n) we have a result as good as the one 
for the AS graph. 

Figure 4f shows that the only graph in which the RWSP 
does not perform well, the Power Grid network. The Power 
Grid network is not a power law graph and has the largest 
diameter of any of the previous networks, thus, we expected 
RWSP to perform poorly. Consulting Table I we observe that 
((fc 2 ) - (fc))/((fc)) = 2.9 is small and thus G' spans only a 
small fraction of the original graph, making the task of finding 
a long path close to impossible. 

V. Discussion & Related Work 

The problem of topology discovery has received signifi- 
cant attention in the (communications) networking literature. 
Broadly, there are two types of topology discovery - proactive 
and on-demand. In proactive "link-state" routing schemes such 
as OLSR [7], each node floods the state of its neighborhood to 
the entire network. Since each flood costs 0(m) forwardings, 
where a network has n nodes and m = 0(n 2 ) edges, the 
total overhead of flooding the network is 0(mn) which is 
0(n 3 ) for dense networks. Variants of link-state routing such 
as HSLS [16] reduce this overhead in dynamic networks to 
C>(n 5 / 2 ) (amortized over time) by restricting the frequency of 
flooding the link-state updates to nodes located farther away; 
or they optimize the flooding proceeding by using Multi Point 
Relays [9]. These schemes are advantageous when the likely 
traffic patterns are all-to-all, i.e., every node is expected to 
transmit; hence, the cost of topology updates is amortized over 
all the nodes. 

However, often only a small number h of nodes in the 
network need to transmit or receive messages to/from each 
other. In these situations, on-demand routing schemes are 
preferred since they attempt to gather topology state when they 
need to communicate [8], [13]. This is typically achieved by 
means of flooding the network with a "route request" message. 
When the intended recipient receives this message, it replies 
with a "route response" message along the reverse path to the 
source (this is unicast). In the worst case, this takes hO{m) 
forwardings in dense networks. 

Opportunistic routing schemes, in contrast, do not at- 
tempt to discover the topology but rather make routing 
decisions adaptively based on the actual transmission out- 
comes. The back-pressure algorithm pioneered by Tassiulas 



and Ephremides [19] explores and exploits all feasible paths 
between each source and destination to achieve throughput- 
optimal routes. Ant colony optimization is a meta-heuristic 
technique that uses artificial ants (specially marked packets) 
to build pheromone trails in the network [4]. 

Random walks have also been used to deliver messages but 
suffer from excessive delays due to long hitting times [18], 
[11]. Hitting times of random walks have been well studied. 
For a large K x K torus, as K — > oo, the maximum hitting 
time between two nodes is H max = Q(K 2 log K). We showed 
in our previous work [6] that for network topologies with node 
degrees concentrated around the mean r, H max w " 1 °f " . A 
corollary to the above is applicable to random geometric net- 
works at the critical radius [14]; in that regime, r w logn, and 
therefore, H max = 0(n). Although in the worst case hitting 
times could have a 0(n 3 ) scaling law [10] (for "dumbbell" 
shaped topologies), in most commonly occurring topologies, 
the scaling properties of H max reside between 0(n) and 
0(n 2 ). Moreover, the hitting time to a subgraph drops sharply 
when the size of the subgraph increases in comparison to the 
hitting time to a specific node [2]. Thus, this could yield lower 
overhead than on demand protocols described above, albeit at 
the cost of a higher routing stretch (the on demand routing 
protocols return optimal shortest paths). 
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